Performance Analysis of the Simplex Method on OpenCL Hardware Accelerators
نویسندگان
چکیده
This work proposes an energy-efficient hardware accelerated Linear Programming (LP) solver. The system is based on the Simplex Algorithm for solving LP problems and is operable on Field Programmable Gate Arrays (FPGAs), Graphic Processing Units (GPUs), and Multi-Core Computer Processors (CPUs). The system is targeted towards the dense problems in radiotherapy applications as they represent a challenge to modern solvers. Performance benchmarking reveals speed ups relative to a sequential implementation that approach 2 and 10 on a CPU and GPU for random, dense problems. The FPGA exhibits unity speed up but proved to be the most efficient in terms of Simplex iterations processed per unit energy with an efficiency 5 times greater than the CPU. This is a notable speed improvement and power saving in comparison with current technology for solving dense problems as the GPU code can solve problems with speeds up to 50 times faster than a sparse solver.
منابع مشابه
OpenCL 2.0 for FPGAs using OCLAcc
Designing hardware is a time-consuming and complex process. Realization of both, embedded and highperformance applications can benefit from a design process on a higher level of abstraction. This helps to reduce development time and allows to iteratively test and optimize the hardware design during development, as common in software development. We present our tool, OCLAcc, which allows the gen...
متن کاملExtending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications
CUDA and OpenCL are the most widely used programming models to exploit hardware accelerators. Both programming models provide a C-based programming language to write accelerator kernels and a host API used to glue the host and kernel parts. Although this model is a clear improvement over a low-level and ad-hoc programming model for each hardware accelerator, it is still too complex and cumberso...
متن کاملSPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance
Hybrid nodes with hardware accelerators are becoming very common in systems today. Users often find it difficult to characterize and understand the performance advantage of such accelerators for their applications. The SPEC High Performance Group (HPG) has developed a set of performance metrics to evaluate the performance and power consumption of accelerators for various science applications. T...
متن کاملTowards Portable Performance for Explicit Hydrodynamics Codes
Significantly increasing intra-node parallelism is widely recognised as being a key prerequisite for reaching exascale levels of computational performance. In future exascale systems it is likely that this performance improvement will be realised by increasing the parallelism available in traditional CPU devices and using massively-parallel hardware accelerators. The MPI programming model is st...
متن کاملOn the Complexity of Robust Source-to-Source Translation from CUDA to OpenCL
The use of hardware accelerators in high-performance computing has grown increasingly prevalent, particularly due to the growth of graphics processing units (GPUs) as generalpurpose (GPGPU) accelerators. Much of this growth has been driven by NVIDIA’s CUDA ecosystem for developing GPGPU applications on NVIDIA hardware. However, with the increasing diversity of GPUs (including those from AMD, AR...
متن کامل